Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Lessons for the Future from a Decade of Informedia Video Analysis Research

Identifieur interne : 001283 ( Main/Exploration ); précédent : 001282; suivant : 001284

Lessons for the Future from a Decade of Informedia Video Analysis Research

Auteurs : G. Hauptmann [États-Unis]

Source :

RBID : ISTEX:8EAEE8462A4CFC7C03932537AF5AAD784FEE76B5

Abstract

Abstract: The overarching goal of the Informedia Digital Video Library project has been to achieve machine understanding of video media, including all aspects of search, retrieval, visualization and summarization in both contemporaneous and archival content collections. The base technology developed by the Informedia project combines speech, image and natural language understanding to automatically transcribe, segment and index broadcast video for intelligent search and image retrieval. While speech processing has been the most influential component in the success of the Informedia project, other modalities can be critical in various situations. Evaluations done in the context of the TRECVID benchmarks show that while some progress has been made, there is still a lot of work ahead. The fundamental “semantic gap” still exists, but there are a number of promising approaches to bridging it.

Url:
DOI: 10.1007/11526346_1


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Lessons for the Future from a Decade of Informedia Video Analysis Research</title>
<author>
<name sortKey="Hauptmann, G" sort="Hauptmann, G" uniqKey="Hauptmann G" first="G." last="Hauptmann">G. Hauptmann</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:8EAEE8462A4CFC7C03932537AF5AAD784FEE76B5</idno>
<date when="2005" year="2005">2005</date>
<idno type="doi">10.1007/11526346_1</idno>
<idno type="url">https://api.istex.fr/document/8EAEE8462A4CFC7C03932537AF5AAD784FEE76B5/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001380</idno>
<idno type="wicri:Area/Istex/Curation">001300</idno>
<idno type="wicri:Area/Istex/Checkpoint">000B92</idno>
<idno type="wicri:doubleKey">0302-9743:2005:Hauptmann G:lessons:for:the</idno>
<idno type="wicri:Area/Main/Merge">001319</idno>
<idno type="wicri:Area/Main/Curation">001283</idno>
<idno type="wicri:Area/Main/Exploration">001283</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Lessons for the Future from a Decade of Informedia Video Analysis Research</title>
<author>
<name sortKey="Hauptmann, G" sort="Hauptmann, G" uniqKey="Hauptmann G" first="G." last="Hauptmann">G. Hauptmann</name>
<affiliation wicri:level="4">
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>School of Computer Science, Carnegie Mellon University, 15213, Pittsburgh, PA</wicri:regionArea>
<placeName>
<region type="state">Pennsylvanie</region>
<settlement type="city">Pittsburgh</settlement>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">États-Unis</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2005</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">8EAEE8462A4CFC7C03932537AF5AAD784FEE76B5</idno>
<idno type="DOI">10.1007/11526346_1</idno>
<idno type="ChapterID">1</idno>
<idno type="ChapterID">Chap1</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: The overarching goal of the Informedia Digital Video Library project has been to achieve machine understanding of video media, including all aspects of search, retrieval, visualization and summarization in both contemporaneous and archival content collections. The base technology developed by the Informedia project combines speech, image and natural language understanding to automatically transcribe, segment and index broadcast video for intelligent search and image retrieval. While speech processing has been the most influential component in the success of the Informedia project, other modalities can be critical in various situations. Evaluations done in the context of the TRECVID benchmarks show that while some progress has been made, there is still a lot of work ahead. The fundamental “semantic gap” still exists, but there are a number of promising approaches to bridging it.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>États-Unis</li>
</country>
<region>
<li>Pennsylvanie</li>
</region>
<settlement>
<li>Pittsburgh</li>
</settlement>
<orgName>
<li>Université Carnegie-Mellon</li>
</orgName>
</list>
<tree>
<country name="États-Unis">
<region name="Pennsylvanie">
<name sortKey="Hauptmann, G" sort="Hauptmann, G" uniqKey="Hauptmann G" first="G." last="Hauptmann">G. Hauptmann</name>
</region>
<name sortKey="Hauptmann, G" sort="Hauptmann, G" uniqKey="Hauptmann G" first="G." last="Hauptmann">G. Hauptmann</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001283 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001283 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:8EAEE8462A4CFC7C03932537AF5AAD784FEE76B5
   |texte=   Lessons for the Future from a Decade of Informedia Video Analysis Research
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024